Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding

نویسندگان

چکیده

Global and local relational reasoning enable scene understanding models to perform human-like analysis understanding. Scene enables better semantic segmentation object-to-object interaction detection. In the medical domain, a robust surgical model allows automation of skill evaluation, real-time monitoring surgeon’s performance post-surgical analysis. This letter introduces globally-reasoned multi-task capable performing instrument tool-tissue Here, we incorporate global in latent space introduce multi-scale (neighborhood) coordinate improve segmentation. Utilizing setup, visual-semantic graph attention network detection is further enhanced through reasoning. The features from module are introduced into network, allowing it detect interactions based on both node-to-node Our reduces computation cost compared running two independent single-task by sharing common modules, which indispensable for practical applications. Using sequential optimization technique, proposed outperforms other state-of-the-art MICCAI endoscopic vision challenge 2018 dataset. Additionally, also observe when trained using knowledge distillation technique. official code implementation made available GitHub.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Dirty Model for Multi-task Learning

We consider multi-task learning in the setting of multiple linear regression, and where some relevant features could be shared across the tasks. Recent research has studied the use of l1/lq norm block-regularizations with q > 1 for such blocksparse structured problems, establishing strong guarantees on recovery even under high-dimensional scaling where the number of features scale with the numb...

متن کامل

Multi-Modal Scene Understanding for Robotic Grasping

Current robotics research is largely driven by the vision of creating an intelligent being that can perform dangerous, difficult or unpopular tasks. These can for example be exploring the surface of planet mars or the bottom of the ocean, maintaining a furnace or assembling a car. They can also be more mundane such as cleaning an apartment or fetching groceries. This vision has been pursued sin...

متن کامل

Multi-Task Learning for Spoken Language Understanding with Shared Slots

This paper addresses the problem of learning multiple spoken language understanding (SLU) tasks that have overlapping sets of slots. In such a scenario, it is possible to achieve better slot filling performance by learning multiple tasks simultaneously, as opposed to learning them independently. We focus on presenting a number of simple multi-task learning algorithms for slot filling systems ba...

متن کامل

Robust Learning and Segmentation for Scene Understanding

This thesis demonstrates methods useful in learning to understand images from only a few examples, but they are by no means limited to this application. Boosting techniques are popular because they learn effective classification functions and identify the most relevant features at the same time. However, in general, they overfit and perform poorly on data sets that contain many features, but fe...

متن کامل

Global structured models towards scene understanding

Many scene understanding tasks are formulated as a labelling problem that tries to assign a label to each pixel of an image. These discrete labels may vary depending on the task, for example they may correspond to di erent object classes such as car, grass or sky, or to depths or to intensity after denoising. These labelling problems are typically formulated as a pairwise Markov or Conditional ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2022

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2022.3146544